Outlier Detection Using SemiDiscrete Decomposition

نویسنده

  • S. McConnell
چکیده

Semidiscrete decomposition (SDD) is usually presented as a storage-eecient analogue of singular value decomposition. We show, however, that SDD actually works in a completely diierent way, and is best thought of as a bump-hunting technique; it is extremely eeective at nding outlier clusters in datasets. We suggest that SDD's success in text retrieval applications such as latent semantic indexing is fortuitous, and occurs because such datasets typically contain a large number of small clusters. Abstract: Semidiscrete decomposition (SDD) is usually presented as a storage-eecient analogue of singular value decomposition. We show, however, that SDD actually works in a completely diierent way, and is best thought of as a bump-hunting technique; it is extremely eeective at nding outlier clusters in datasets. We suggest that SDD's success in text retrieval applications such as latent semantic indexing is fortuitous, and occurs because such datasets typically contain a large number of small clusters.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Outlier Detection in Wireless Sensor Networks Using Distributed Principal Component Analysis

Detecting anomalies is an important challenge for intrusion detection and fault diagnosis in wireless sensor networks (WSNs). To address the problem of outlier detection in wireless sensor networks, in this paper we present a PCA-based centralized approach and a DPCA-based distributed energy-efficient approach for detecting outliers in sensed data in a WSN. The outliers in sensed data can be ca...

متن کامل

Shape Outlier Detection Using Pose Preserving Dynamic Shape Models

In this paper, we introduce a framework for shape outlier, like carrying object, detection in different people from different views using pose preserving dynamic shape models. We model dynamic human shape deformations in different people using kinematics manifold embedding and decomposition of nonlinear mapping using kernel map and multilinear analysis. The generative model supports pose-preser...

متن کامل

Outlier Detection Using Extreme Learning Machines Based on Quantum Fuzzy C-Means

One of the most important concerns of a data miner is always to have accurate and error-free data. Data that does not contain human errors and whose records are full and contain correct data. In this paper, a new learning model based on an extreme learning machine neural network is proposed for outlier detection. The function of neural networks depends on various parameters such as the structur...

متن کامل

Condition Monitoring of a Wind Turbine Gearbox using the Empirical Mode Decomposition Method and Outlier Analysis

Wind turbines are subject to variable aerodynamic loads and extreme environmental conditions. Wind turbine components fail frequently, resulting in high maintenance costs. For this reason, gearbox condition monitoring becomes important since gearboxes are among the wind turbine components with the most frequent failure observations. The major challenge here is the detection of faults under the ...

متن کامل

Local Outlier Detection with Interpretation

Outlier detection aims at searching for a small set of objects that are inconsistent or considerably deviating from other objects in a dataset. Existing research focuses on outlier identification while omitting the equally important problem of outlier interpretation. This paper presents a novel method named LODI to address both problems at the same time. In LODI, we develop an approach that exp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002